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A subset of Statistical Process Control (SPC) 
methodology known as Control Charting is introduced • SPC methodology 
is a collection of graphical and inferential statistics techniques 
used to study the progress of phenomena over time. The types of 
control charts covered are the null X (mean), R (Range), X 
(individual observations) , MR (moving range) , p (measure of 
successes) , and U (number of events per unit) . Uses of charts in the 
binomial case and the Poisson case are considered. These charts 
provide the researcher V7ith a combined graphical and hypothesis 
testing procedure for assessing the stability of various types of 
data collected across time. The details of constructing each of these 
charts and the methods for hypothesis testing are described. Examples 
drawn from the field of .aucation illustrate chart construction. SPC 
procedures described are understandable by researchers who are not 
mathematically oriented; whereas other procedures often recommended 
in the statistically-oriented literature on single-subject design are 
not easily understood by these researchers. All of the SPC techniques 
introduced can be modified to allow the analysis of data from one-way 
multiple group designs and factorial designs. Although these charts 
do not represent all the control charting techniques used in process 
control, they are the ones most applicable to educational research. 
Five tables present sample data, and seven graphs illustrate SPC 
charts. (SLD) 
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ABSTRACT 



The purpose of this paper is to introduce the reader to some of 
the control-charting techniques used in Statistical Process 
Control. The types of control charts covered are the x-, 
MR-, p- and u-charts. These charts provide the researcner with a 
combined graphical/hypothesis testing procedure for assessing the 
stability of various types of data collected across time. The 
paper describes the details of constructing each of these charts 
and the methods for hypothesis testing. Educational examples are, 
given throughout. 
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Introduction 

Statistical Process control (SPC) methodology is a collection 
of graphical and inferential statistics techniques used to study 
the progress of phenomena (e.g., the output of a particular 
machine or a specific behavior for an emotionally impaired 
youngstfir) over time. SPC began in the 1930 's with the 
development of control charting techniques by Shewhart (1931), 
but it has not been until the last 15 years that SPC has become 
popular in the United states, although it has been used 
continuously In some American industries (e.g.. Western Electric) 
since the 1930's. It should be noted that the popular use of SPC 
in Japan began approximately 40 years ago. Further, the use of 
SPC techniques both in Japan and the United states seems to have 
been restricted in the past, as well as the present, almost 
entirely to industrial settings. 

The popularity of the sPC methodology in industrial settings 
can be seen by examining introductory statistics textbooks 
written for business and engineering students. Almost all of 
these textbooks have from a section to one or more chapters 
dealing with SPC techniques. To the best of the author's 
knowledge, statistics textbooks in education, the behavioral 
sciences and the social sciences do not, however, mention these 
techniques at all. This even includes books with chapters on the 
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analysis of longitudinal data. Further, a computer-assisted 
search of the education literature (via the ERIC system) revealed 
only two published articles and no unpublished literature that 
dealt with any aspect of SPC. Each of the articles only dealt 
with a very narrow application of only the same small piece of 
SPC methodology. Hence, there is a need for an expository paper 
that introduces SPC methodology to the educational research 
community. 

The main purpose of this paper is to introduce the reader to a 
large subset of Statistical Process Control methodology known as 
Control Charting, some of the more popular types of control 
charts used presently in Statistical Process Control will be 
described in detail and ways in which these charts might be used 
to analyze educational data will be suggested, where 
appropriate, the theoretical statistical underpinnings for the 
control charts will be discussed. It should be noted up front 
that the author has only recently learned the details of control 
charting. Hence, this , aper is more of a beginning rather than a 
definitive review of the methodology, it is hoped, however, that 
others will still find this paper useful. 

"X- and R-charts 

The most commonly used SPC technique is the combination of 
the X-chart and the R-chart. The data collected for use with 
this technique are of the following form. A constant number, n. 
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of observations, with n ^ 2, are collected on some process at a 
number of time points, k, usually with k k 25. The time points, 
may or may not be, equally spaced. The observations at each time 
point must arise from a well-defined process and must be a sample 
from the total (i.e., population) output of the process. This 
sample may be a simple random, systematic, cluster, or stratified 
sample or arise from some combination of these sampling methods. 
The dependent variable measured on each unit (e.g., a part or a 
child) should have an underlying continuous distribution or, at 
the minimum, should be measured on an interval scale. Some 
examples of data sets where the technique of and R-charting is 
appropriate are: 

1. The weights of or number of pieces in a box of candy. 

2. The thickness of a metallic coating. 

3. Daily/weekly quiz scores for a class. 

4. Weights of people using some behavioral modification 
technique . 

As the data are collected, the mean (denotad X^) and the Range 
(R^) are computed for each sample, i = 1, 2, k. 
After a sufficient number of baseline samples (usually k ^ 25) 
are collected an estimator of the population standard deviation 
is computed based on the R^'s. Although estimating the standard 
deviation using the ranges may seem strange at first, it is 
actually an excellent estimation technique. First, x- and R- 
charts are often compiled by workers on production lines as part 
of their duties each shift. The computing of the range takes a 
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few seconds, while the computing of a standard deviation (even 
with a calculator with statistical capabilities) takes much 
longer. Second, the range can be computed even without a 
calculator, if absolutely necessary, by most production workers. 
In educational settings, as will be discussed more below, one of 
the more obvious applications of control charting is in the 
analysis of data arising from single-subject designs implemented 
in a school or other institutional setting. The data e ^ often 
collected by teachers or other professional personnel to help 
them evaluate the behavior (s) of a student or small group of 
students. For these professionals, computing the range instead 
of the standard deviation advantageous time-wise because of the 
multiple demands of their time. Third, the concept of a range is 
easier for people to understand than is the concept of a 
standard deviation. Besides its simplicity, the use of ranges 
leads to estimators of the population standard deviation that 
have desirable statistical properties. For the following 
discussion and for much of the remainder of this paper, the focus 
will be on the analysis of data collected when no treatments are 
being applied (i.e., when baseline data are being collected). 
The question of how to detect treatment effects will be addressed 
later in this paper. 

Statistical Background 

Let denote the population standard deviation for the 
dependent variable of interest. Construct a new variable, , 



with = Interestingly, when the X^'s are distributed 

independently as normal random variables with identical means and 

variances (i.e., they are i.i.d.), the expected value (i.e., 

population mean) of each of the W^'s is solely a function of the 

sample size, n (Montgomery, 1985). Denote this expected value by 

the symbol d2. So, = E(W^) = E{R^/a^) = E(R^)/c^. Hence, a^= 

E(R^)/d2. Summing over the k samples, a = (E( S R./k))/d,. Let 

R equal the mean of the R 's (i.e., R = s R./k) . Then ay = 

_ _ _ i-1 ^ 

E(R)/d2 = E(R/d2). Hence, R/d2 is an unbiased estimator of a^. 

Some comparison of this estimator to an estimator based on the 

usual least squares estimators, S^ , of c? seems appropriate 

2 " - 
here. Let S^ = (^Sj(x^^- X^))/(n-l) denote the usual least 

squares estimator of oj for sample i. Then S? is an unbiased 

2 A— ^ 

estimator of , but s^ = /s^ is not an unbiased estimator of a^. 

When the X^'s are independent and are distributed with identical 

normal distributions then the bias of each of the 

S^ 's is well known, and an unbiased estimator of is given by 

\ /C/^ , where 

C4 =/L.,2_\^ . r(n/2) : 
\ n-lj r{(n-l)/2) • 

Let S =^S^S^/k. Then S/c^ is an unbiased estimator of . Even 

though R/d2 and s/c^ are both unbiased estimators of a^, R/d2 is 

not as efficient as S/c^. Montgomery (1985, pg. 174) gives a 

table comparing the relative efficiency of R/(^ to s/c^ for 

various values of n. Montgomery (1985) and others also give 

tables of the values of d2 and . None of these textbooks, 

however, gives a formula for or a reference to the origin of 



the table. Table 1 of this paper reproduces the values of 6,2, 
c^, and the relative efficiencies for n < 10, since the relative 
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efficiency of R/d2 (when compared to S/c^ ) is approximately equal 
to 1 for n £ 5, and since it is rare to have samples of size 
n > 5, it is common practice to use R/d2 instead of W/c^ because 
of its simplicity of computation. Hence, an unbiased estimator 
of o^, the mean of population standard deviation of the sampling 
distribution of each of the X 's, is given by a- = R/(d^^/n) and 
it is this estimator that is used when constructing control 
charts • 

Finally, an estimator is needed of a^, where is the common 
population standard deviation of the distributions of the R^^'s. 
The population standard deviation of W^, where = Ri/o^/ is 
solely a function of n (Montgomery, 1985) and is usually denoted 
by the symbol Hence, d3 = VarCW^ ) = Var(Rt/ax) = Var(Ri)/aJ 

~ Solving for a gives, a - d.a . Thus, an unbiased 

estimator of is given by (d3R)/(d2), since dj is a constant 
and R/(d2) is an unbiased estimator of a^. The values of d3 are 
given in last column of Table 1. 

Hypotheses Tested 

The null and alternative hypotheses tested by those using X- 
and R-charts are: 
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Hq: The process is in control (stable) 

H^: The process is NOT in control (NOT stable) 
where *^in control" means that the X^^s are identically 
independently distributed (i.e,, i^i.dO normal and the R^^s are 
i.i.d. also. In educational settings these same hypotheses are 
often of interest with respect to the baseline observations on a 
single-subject or a same group of subject because, if the 
baseline is not sti^ble, then there is no known statistical 
methodology that can always separate treatment effects from what 
would happen naturally in the absence of treatments (Huitema, 
1986). It should be noted that there are, of course, a variety 
of methods other than the methods to be described in this paper 
that have been developed for ascertaining whether a baseline is 
stable (see e.g., Hersen & Barlow, 1976; Huitema, 1986). To test 
the null hypothesis versus the alternative hypothesis, two 
separate charts, the X- and the R-chart are constructed. Once 
the charts are constructed, a series of rules is then applied. 
The construction of these two charts will be discussed next and 
will -be followed by a discussion of the series of rules used for 
hj^othesis testing. It should be pointed out to the reader that 
it is rare to find the idea of hypothesis testing mentioned when 
this series of rules is applied. In reality, however, it is a 
hypothesis test that is being performed. 
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Construction of the X- and R-charts 

Two charts are constructed simultaneously in order to test the 
hypotheses stated above. The first chart, the X chart, is 
constructed by plotting ordered pairs with the sample number 
(i.e., the i's) as the first coordinate and the X^'s as the 
second coordinate. Seven parallel horizontal lines are then put 
on the chart. These lines are: 

(1) The Center Line, X, where 2" represents the grand mean 
^ k 

(i.e., X = ^S^X^/k ); 

(2) 5t (3): X - and X + ; 
(4) & (5): X - 2cL and X + 2a_ ; 



usually called the Lower Control Limit (LCL) and the Upper 
Control Limit (UCL), respectively. They are often also referred 
to together as the 3-a limits. The X t 2cl. lines are often 



referred to as the 2-a limits, while the X i la_ lines are often 



referred to as the 1-a limits. It also seems to be customary to 
draw in the Center, the LCL and the UCL lines heavily and draw in 
the 1-a and 2-a lines either lightly or not at all. » 

Table 2 presents a data set that will a used to construct an 
example of an X-chart (as well as an R-chart). Each sample in 
the data set is composed of the measurements taken every half- 
hour of the voltage produced by four different supply units. If 
the reader prefers an educational examp3e, these data can be 
thought of as the number of words read per minute by four 



(6) & (7): X - 3^ and 
where = R/(d2v'n). The 



X + 3^ , 

- 3ar and the. X + 3a- lines are 
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.different individuals as measured on 25 consecutive Monday 
mornings. For these data, k=25, n=4,X= 349.808 and R = 
2.512 . From Table 1, d, = 2.059. Hence, a_ = 2.512/(2.059^/4") = 
.6100 . This yields an UCL- of 351.638, a value of 351.028 f or X 
+ 2^-, a value of 350.418 for X + a-, a Center Line of 349.808, a 

A 

value of 349.198 for 1 - o-, a value of 348.588 for X - 2o_, and 
a LCL- of 347.978 . The X-chart is given in the upper portion of 
Figure 1. 
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The R-chart is constructed in a manner similar to the X-chart. 
The points graphed are the ordered pairs (i, ) . The Center 
Line is given by R. The UCL is given by R + 3c^ , where ^ = 
JdjRj/dj, and the LCL is given by the maximum of {0, R - 3a„}. 
This definition of the LCL is advantageous since, by definition, 
the R^*s are always nonnegative. The other four lines are given 
by R + 2d^, R + la^, Max{0,R - and Max{0, R - 20^^}. For the 
data set presented in Table 2, R = 2.512 . From Table 1, d^ = 
2.059 and dj = .8798 . Hence = ( .8798*2. 512)/2. 059 = 
1.0734 . Thus, UCI^ = 5.7322, R + 2^ = 4.6583, R + = 3.5854, 
the Center Line = 2.5120. R - = 1.4386, R - 2a^ = .3052, and 
LCLj^ = 0» The bottom portion of Figure 1 gives the R-chart for 
these data. It should be noted that some authors (e.g., wheeler 
& Chambers, 1986) prefer to say that LCL_ "does not exist" when 

R 

the value of R - 3a-, < 0. 

R 
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Hypothesis Testing 

The null hypothesis that a process is stable is rejected if 
any of the following 12 conditions are met (Western Electric, 
1956, pp. 25-26). The first six conditions are: 

(i) One of more points on the x-chart are beyond the 3-a 

limits. That is, for some i, |X - X| > 3a-. 

' i • X 

(ii) Two out of three successive points on the X-chart are 
beyond the same 2-a limit. That is, for some i, at least two of 
{X^_2 , X^ j , X^} are less than X - 2a^ or at least two of {X^_2, 
X^_j , X^> are greater than X + 2a^. 

(iii) Four out of five successive poirts are beyond the same 
1-a limit. 

(iv) Eight successive points fall on the same side of the 
Centsr Line. 

(v) A run up (i.e., nondecreasing X's) or a run down (i.e., 
nonincreasing X's) occurs for seven out of eight successive 
points. 

(vi) An unusual or non-random pattern emerges for the X's. 
For example, if the observations were taken daily and Mondays 
were always high and Fridays were always low. 

Conditions (vii)-(xii) are analogous to conditions (i)-(vi) 
except that X is replaced by R throughout. 

The probabilities of violating conditions (i) through (iv) 
when is true (i.e., the probabilities of a Type I error) can 
be easily calculated when one assumes that the X^'s are i.i.d. 
normal, although none of the sources consulted for this paper 
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derived these probabilitie.'5 or even simply gave the probabilities 

without proof. For condition (i), the probability of a Type I 

error for any particular value of i is equal to P(j^ X| > 3^) 

= P(|2| > 3) = .00270 , where z is a standard normal random 

variable. For condition (ii), the probability of a Type I error 

for any particular i is equal to P(Two of {X^_2 , X^_j , X^} > X + 

2qj ) + P(Two of {X. , X , X } < X - 2a-) = (1) (P(z > 2))"+ 

(\yp(z < -2))2= 2(3P(z > 2^-= .0031054 . Similarly, for 

condition (iii) the probability of a Type I error for any 

particular i is equal to 2(^^J(P(z > 1))^) = .0063368 . Finally, 

for condition (iv) the probability of a Type I error for any 

particular i is equal to 2(P(z > 0)^) = .0078125 . The 

probabilities of a Type I error associated with conditions (v) 

and (vi) were not discussed in any of the references used for 

this paper, nor does this author know of a good way of estimating 

them. Under Hq, however, these errors logically seem to be rarer 

than those associated with conditions (i) to (iv). Hence, a 

rough approximation is available to the probability of one or 

more Type I errors occurring when the family consisting of 

conditions (i) to (vi) is applied. This approximation is given 
iv 

by a = 1 - (1 -P(Type I error associated with condition (j)). 
This approximation assumes that conditions (i) to (iv) are 
independent and ignores conditions (v) and (vi) . To the extent 
that conditions (i) to (iv) are not independent, the 
approximation overestimates a and to the extent that conditions 
(v) and/or (vi) hold, this approximation underestimates a. Using 
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this approximation, a value of • 0198151 is obtained. Hence, the 
probability of a Type I error using this set of conditions on the 
X-chart is approximately .02 . None of the references used 
discussed the probability of a Type I error associated with the 
application of conditions (vii) to (xii) to the R-chart. The 
derivation of the probabilities would be much more complicated 
than for conditions (i) to (vi) because the sampling distribution 
of the R's is positively skewed. Further, no mention was made in 
any of the references studied by the author of the power 
associated with this hypothesis testing procedure; but, this is 
not all that surprising since this procedure is rarely thought of 
as being a hypothesis testing method in Quality Control 
textbooks • 

Some Examples 

Returning to the data from Table 2 (and whose corresponding 

control charts are portrayed in Figure 1), it can determined from 

visual inspection of the 2^*s or of the X-chart that none of the 

conditions (i) to (v) is violated. Condition (vi) is 

subjective, but in this author* s opinion is not violated in this 

data set. Conditions (vii) to (xi) are also not violated here, 

as can be determined by visual inspection of either the Ranges in 

Table 2 or the R-chart. Condition (xii) is also subjective and 

in this author's opinion does not seem to be violated. Hence, 

since none of the 12 conditions is violated, H is not rejected 

o 

and it can be concluded that there is not yet enough evidence to 

15 
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conclude that the process is not stable, if the data from Table 
2 are thought of as reading rates for four children measured over 
a 25 week period, then it would be concluded that there is not 
yet enough evidence to conclude that the students' reading rates 
are unstable. 

As a second example, consider the data presented in Table 3. 
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These data are the means and ranges, for samples of size n = 2, 
for the weights of a particular product for k = 30 observation 
times. Similar to the above example, the reader can think of 
reading rates as the dependent variable instead of weights and 
the observations as being rates for two randomly selected 
passages read by a single individual at 30 different times. For 
these data, X = 278.3333, R = 31.333, d2 = 1.128 and dg = .8525 . 
Hence, = R/(d2^^) = 19.6418 and = d3R/d2 = 23.6805 . So, 
UCI^ = 337.2587, X + 26j = 317.6169, 1 + ^ = 297.9751, the 
Center Line = 278.3333,' X - % = 258.6915, X - 2a- = 239.0497, 
and LCL^ = 219.4079 . 

Condition (i) is not violated since none of the 30 Xj^'s are 
beyond the Upper or Lower control Limits. Condition (ii) is 
violated for i = 5, since the X^'s for both times 4 and 5 are 
below X - 2a_ . since, condition (ii) is violated, we can reject 
and conclude that the process is unstable. 

ERIC 1 G 
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For the purposes of this paper it was decided to continue to 
investigate violations of the conditions further in order to help 
the reader understand the 12 conditions better. Also, it is 
common practice in Statistical Process Control to investigate all 
12 conditions since each violation might have arisen from a 
different cause and the more causes that can be identified and 
corrected the better the final product will be. It is the 
author's opinion that this same argument holds for educational 
data; the main difference, however, is that causes are often 
harder to identify in educational settings than in industrial 
settings. But, perhaps with practice using control charts, 
educators will improve their abilities to identify causes, since 
the violations of the 12 conditions explicitly point out where 
and in what direction changes have occurred rather than simply 
indicating there has been some change. 

Returning to condition (ii), it is also violated for i=17, 

since the means for times 15 and 17 are both above X + 2o-. 

X 

Condition (iii) is violated for i=8 (since the means for times 
4,5,6, and 8 are below X - a-) and for i=9 (since the means for 
times 5,6,8 and 9 are below X - Conditions (iv) and (v) are 

A. 

not violated nor, by visual inspection, does condition (vi) seem 

to be violated. Turning to the R-chart, UCL = 102.3748, R + 2a 

R R 

= 78.6943, R + = 55.0138, the Center Line is R = 31.3333, R - 

= 7.7528, Max{0,R - 20^^} = 0, and LCL^ = 0. Since none of the 
R^'s are greater than 102.3748 (nor, of course, less than 0), 
condition (vii) is not violated, condition (viii) is violated for 



ERIC 



17 



15 

i-17, since the ranges for times 15 and 17 are both greater than 
78.6943. Condition (ix) is not violated. Condition (x) is 
violated for i=30, since the R^'s for times 23 through 30 are all 
below R. Condition (xi) is not violated and, by visual 
inspection, condition (xii) does not appear to be violated. 

Moving Range Charts 

In many educational, as well as industrial, settings only a 
single observation (X^) can be obtained on an individual or 
machine at each time point. The X- and R-charts can not be used 
directly in these cases. They can be modified, however, into a 
useful set of charts by introducing the concept of a moving 
* range. Some examples of situations where moving range charts are 
useful are: 

1. An industrial process where each unit takes a long time to 
produce. 

2. Any single-subject research (i.e., n - 1) design where the 
variable of interest is continuous (e.g., weight or time on 
task) . 

In most circumstances the moving ranges (denoted MR^'s) are based 
on sets of two consecutive observations, X^^j and X^, and are 
defined as the range of these two observations (i.e., MR^ = 

X^J). The individual X^'s then play the role of the X^'s 
and the MR^'s play the role of the R^'s from the X- and R-chart. 
The charts used in this situation are referred to as the X-chart 
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and the MR-chart. It is suggested practice (American Society for 
Testing and Materials, 1976) to take moving ranges based on 
exactly two observations. There is no difference in the 
statistical bases used to analyze these charts if three or more 
observations are used to compute the moving ranges. 

The use of X- and MR-charts can be illustrated by using the 
data from Table 3 and thinking of the "Weekly Averages" as 
reading rates for one student over 30 weeks based on a single 
passage (rather than two passages), since now n = 1, the X- and 
R-charts are not appropriate and the X- and MR-charts must be 
used. Table 4 presents the moving ranges (and for convenience 
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the X^»s). The mean moving range is denoted by MR and is equal 

to ( S (MR) )/(k-l). For this example, MR = 36.2069. By the 

i=2 i 

earlier discussion of the statistical underpinnings of the X- 
chart, = MR/d2, where the value of d^ is determined from Table 
1 using n = 2 (since each of the moving ranges is based on two 
observations). That is, d2= 1.128. Further, = where 
d^ is determined from Table 3 using n = 2. That is, d^ = .8525 . 
For this example, = 32.0983 and 27.3638 . Hence, for the 

X-chart, UCLj^ = 374.6282, X + 2c^ = 342.5299, X + a^^ = 310.4316, 
the Center Line = 278.3333, X - a., = 246.2350, X ~ 2a = 
214.1367, and LCL^^ = 182.0384 . For the MR-chart, MR + 3a^ = 
118.2983, MR + 20^^ = 90.9345, MR + G^^j^ = 63.5707, the Center 
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Line = 36. 2069, MR - = 8.8431, and the -2a^ limit and the LCh^ 
are both equal to zero. Figure 3 gives the X- and MR-charts for 
this example. 
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Applying the hypothesis testing procedure discussed earlier to 
these charts, it can be seen that conditions (i) and (ii) are 
not violated. Condition (iii) is violated when 1=8, since 
observations 4,5,6, and 8 are all below X - a^. Hence, H is 
rejected and it is concluded that the reading rates are not 
stable. Continuing to investigate the violations of the 12 
conditions (for the reasons cited previously), it can be seen 
that condition (iii) is violated also when i = 9, since 
observations 5,6,8, and 9 are all below X - a^. Conditions (iv) 
and (V) are not violated. By visual inspection, condition (vi) 
does not seem to be violated. Turning to the MR-chart, none of 
the conditions (vii) to (xi) are violated. Further, by visual 
inspection, condition (xii) does not seem to be violated. This 
example clearly shows the loss of power associated with the 
hypothesis testing procedure when only one observation, instead 
of even just two, is taken at each time point, since when the 
data were considered to have two observations per time point 
there were six violations of the conditions, while treating the 
data as having only one observation per time lead to only two 
violations. 
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This is an important point for educational researchers to 
note. Too often in single-subject research, researchers or 
others collect (or record) only one observation per time point, 
while it would be very little extra work in some cases to 
collect two or more observations. The reading rates example is a 
good example of this. The increased number of observations will 
lead to increased power for detecting a non-stable baseline and 
will thus save the researcher the time and money wasted by 
applying a treatment in a situation where the baseline is not 
stable. 

Charts Based on Counts 
In some cases the observations collected over time are 
integer-valued counts that follow approximately either a binomial 
or Poisson probability density function. The most common 
industrial situations where counts occur are when one is 
interested in the number of defective parts per lot (which 
follows a binomial density) or the number of defects per unit 
area, such as on a piece of material (which follows a Poisson 
density). Some educationally oriented variables that follow a 
binomial density are: 

1. Daily class attendance 

2. Number of students on task at any point in time 

3. Number of students getting each of a series of test items 
correct/incorrect. Here, the longitudinal dimension is the test 
items rather than time. 
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some educationally oriented variables that follow a poisson 
density are: 

1. Numiier of misbehaviors per day by an emotionally impaired 
student. 

2.. Numb'^.r of spelling errors per 100 words in an essay. 
The analyses of counts that follow either a binomial or Poisson 
density use the properties of the appropriate density to estimate 
the appropriate standard deviations. 

The Binomial Case 

For the binomial case, only one chart is constructed per data 
set. For both this case and the Poisson case, an unequal number 
of observations is allowed per time point, as opposed to the X- 
and R- charts and the X- and MR-c^arts cases where the sample 
sizes were required to be equal across time. When the sample 
sizes (denoted as the n^ 's, where i represents the time point 
and i=l to k, with k = number of time points) are unequal for the 
different time points the chart used is a p-chart. when the 
sample sizes are equal for the different time points either a p- 
chart or an np-chart is used (depending upon the preference of 
the researcher) . The details of constructing p- and -np-charts 
will now be described. 

The data collected at each time point can be thought of as 
arising from n trials whe^re each trial represents a single trial 
of a binomial probability experiment performed on an individual 
unit (e.g., a part or a student). Recall that a binomial 
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probability experiment has the following properties (Johnson, 
1988) : 

1. Each trial has two possible outcomes. These two outcomes 
are called "success" and "failure" 

2. P(Success) = p, P(Failure) = q, and p+q=l . 

3. The random variable represents the number of successes 
in the n^ trials. 

Next, define the variable p^ to be equal to X^/n^. Finally, 

— ^ ^ >_ _ 

define p = ( S X^)/( S n.) and q = l-p. Recall that the variance 
i=l ^ i=l ^ 

of p^ is given by pq/n^ . An estimator of this variance is given 
by pq/n^ . Since the p^ 's may be based on different n^'s and 
since the estimated variances of the p^'s are dependent on the 
n 's, the control limits will be dependent on the n^ 's. The UCLq 
is defined as p + 3,/pq/n^ , the 2-a upper limit as p + 2,^q/n^, 
the 1-a upper limit by f + ^/p^yn^ , the Center Line by p , the 1-a 
and 2-a lower limits by p - ^^q/n^ and p - 2v^q/n_^ and the LCLp^ 
is defined as p - 3^pq,/n± . As with the R-chart, if any of the 
formulas for the lower 1-a, 2-a, or control limits yield negative 
values, then a value of zero is used instead for these limits. 
Table 5 gives some data that are appropriate for this type of 
charting. Notice that for these data a success is defined as an 
"incomplete invoice" while a failure is defined as a "complete 
invoice". Although may seem strange at first, it is best for 
ease of interpretation to define a success as that value of the 
characteristic which is of most interest to the researcher. 
Figure 4 gives the p-chart and the UCLp's and LCLp's for the data 
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in Table 5. The 1-a and 2-a limits were not dravm in because 



INSERT TABLE 5 AND FIGURE 4 ABOUT HERE 



they make the chart overwhelming to look at. The hypothesis 

testing procedure dei=^<:ribed in the X- and R-chart section can be 

applied to p-charts by substituting the symbol "p" wherever the 

symbol "X" appears in Conditions (i) to (vi). Conditions (vii) 

to (xii) are, of course, ignored since there is only one chart 

here. For the data of Table 5 and portrayed in Figure 4, it can 

be seen that condition (i) is violated since for i=6, Pg ic above 

the UCLj . Kence, is rejected and it is concluded that the 
6 

process (i.e., the number of incomplete invoices) is unstable. 

When all the n^'s are the same, the p-chart becomes much 
easier to draw since the horizontal lines (e.g., UCLp^ and LCLp^ ) 
are the same for all values of i. when all of the n^'s are equal 
their common value is denoted by n. Further, when all the n^'s 
are equal, a chart called an np-chart is often drawn instead. 
The np-chart is almost identical to the p-chart. The only 
difference is that the vertical axis goes from 0 to 1 for the p- 
chart but from 0 to n for the np-chart. That is, the distinction 
between the np-chart and the p-chart is exactly the same as the 
distinction between a raw frequency histogram and a relative 
frequency histogram. 
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The Polsson Case 

As in the Binomial case, only one chart is constructed when 

the characteristic of interest follows a Poisson distribution. 

•^he chart is called a u~chart, where u is the symbol for the 

number of events (e.g., nonconformities, misbehaviors, misspelled 

words) per "unit". The "unit" is defined by the researcher. For 

example, if one is interested in misbehaviors, then the unit 

could be a day. If one were interested in the number of 

misspelled words in an essay, then a unit could be defined as a 

50-word block. At each time point data may be^ collected on some 

number of units, a^, where aj^ is some nonnegative real number (a^ 

need not be an integer). Let c^ denote the total number of 

events occurring at time i and let u^ = c^/a^ . Let u = 
k k 

ili^i^' ^ PoiBson density the appropriate estimated 

standard deviation of u^ is given by Ja/al . The UCL is given 
by u + 3^/u/a^ , the upper 2-a limit by u + 2/u/a^ , the Center Line 
by u, the LCL^^ by u - 3v/u/£^ ,etc. . The statistical testing 
methodology developed for the X'- and R-charts is then applied 
with slight modification by replacing "X" in conditions (i) to 
(vi) by "u" and by ignoring conditions (vii) to (xii). If all 
the a^'s are equal to one, then - ^ t and the symbol c is used 
throughout instead of c and the chart is known as a c-chart. 

It should be noted by the reader that there is a slight 
problem with the use of the 1-a, 2-a, and 3-a limits as defined 
above for the Binomial end Poisson cases. The problem is that 
the Binomial and Poisson densities are not symmetric. Hence^ 
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when symmetric limits are used the probability of a Type I error 
is a function of some unknovm population parameter. This 
parameter is p in the Binomial case and u (the population number 
of occurrences per unit) in the Poisson case. Some people prefer 
to use "probability limits" instead. The use of probability 
limits is discussed in Grant & Leavenworth (1988). 

Assessment of Treabmtint Eff ^^ cts 
For any of the above cases (i.e., X's, Moving Ranges, or 
Counts), the effects of some treatment can be assessed once a 
stable baseline has been established. To do this, the researcher 
computes the UCL, 2-a and l-a upper limits, the Center Line, the 
1-a and 2-a lower limits, and LCL based on the baseline data. 
The treatment is then applied. Additional measurements are then 
taken either during or after treatment (depending on the 
treatment). The hypothesis testing procedures discussed above 
for the various cases are then applied to the new measurements 
using the UCL, Center Line, LCL, etc. computed from the baseline 
data. If any of the conditions (i) to (xii) is violated, it can 
then be concluded that there is a treatment effect. It is 
realized by the author that this procedure is a conditional 
procedure based on the correct decision being made about the 
baseline. That is, if a Type II error occurs (i.e., the baseline 
is truly unstable* but the researcher wrongly concluded it was 
stable) when testing the baseline data, then conclusions made 
about treatment effects may not be correct since the baseline was 
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unstable, it should be noted, however, that this procedure is 
still better than visual inspection, which is the usual method 
used by many researchers. Further, the procedures described in 
this paper have the advantage over other procedures suggested in 
the literature for the analysis of single-subject data that they 
are understandable by non-mathematically oriented researchers. 
The other procedures often i . commended in the statistically- 
oriented literature on single-subject design (e.g., time series 
methods and randomization tests) are not easily understood by 
these researchers. 

Some Final Notes 

1. All of the SPC techniques introduced in this paper can be 
modified to allow for the analysis of data from one-way multiple 
group designs and factorial designs. 

2. This paper does not contain all of the control charting 
techniques used in Statistical Process Control. It does, 
however, cover the ones that are used heavily in Statistical 
Process Control. Fortunately, these are the same techniques that 
.are^-usef ul In educational settings . There are also Statistical 
Process Control techniques (such as Process Capability studies) 
that are not based on control charting, but the author has yet to 
come up with good educational applications of these techniques. 

3. As the author was finishing this paper she became aware of a 
new textboc'- in statistical process control by Ryan (1S89). 
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Table 1 

Values of Relative Efficiency, d^^ cvr and d3 



n 


Relative Eff. 


da. 


C y 


d5 


2 


1. 000 


1.128 


.7979 


. 8525 


3 


.992 


1.693 


.8862 


.8884 


4 


.975 


2.059 


.9213 


.8798 


5 


.955 


2.326 


.9400 


.8641 


6 


.930 


2.534 


.9515 


.8480 


7 




2.704 


.9594 


.8332 


8 




2.847 


.9650 


.8198 


9 




2.970 


.9693 


.8078 


10 


,850 


3.078 


.9727 


.7971 



Note: The values for relative efficiency for n = 7, 8, and 9 

were not reported in Montgomery (1985). The values for d 
are from Grant and Leavenworth (1988). 
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Table 2 
Voltage Data 





DC-voftage output at 20 mA 


1 

1 




Subgroup 




Sample letter 




















number 


a 


h 


c 


(/ 


Average X 


Range R 


1 


•348.5 


350.2 


348.3 


350.3 


349.3 


2.0 


2 


351.3 


351.2 


347.1 


349.7 


349.8 


4.2 


3 


348.5 


350.5 


348.5 


349.0 


349.1 


2.0 


4 


351.4 


350.4 


348.6 


353.2 


350.9 


4.6 


5 


349.4 


348.0 


349.6 


351.1 


349.5 


3.1 


O 


351.1 


348.1 


349.2 


350.1 


349.7 


3.0 


7 


348.3 


349.9 


350.7 


348.5 


349.4 


2.4 


8 


349.9 


349.1 


349.0 


349.6 


349.4 


0.9 


9 


- 349.2 


34S.7 


348.8 


350.3 


349.3 


1.6 


10 


349.2 


351.6 


O CI C\ 

351.9 


349.2 


350.5 


2.7 


11 


350.1 


350.5 


35J.2 . 


. 347.9 . 


349.9 


3.3 




350.4 


350.8 


350.3 


352.6 




Z.J 


13 


347.7 


349.6 


348.6 


349.3 


348.8 


1.9 


14 


349.0 


351.1 


350.2 


348.0 


349.6 ■ 


3.1 


15 


350.7 


349.3 


349.3 


350.2 


349.9 


1.4 


16 


350.0 


351.8 


352.3 


349.8 


351.0 


2.5 


17 


350.1 


349.8 - 


349.6 


349.2 


349.7 


0.9 


18 


351.1 


350.6 


346.9 


349.8 


. 349.6 


4.2 


19 


351.4 


349.3 


349.7 


349.6 


350.0 


2.1 


20 


348.8 


349.6 


„35L3_ 


.- .349.2 . 


.349.7 . 


2.5 


21 


349.4 


350.2 


350.2 


351.8 


550.4 


2.4 


22 


351.7 


351.6 


349.9 


347.1 


350.1 


4.6 


23 


350.4 


349.0 


349.2 


349.6 


349.6 


1.4 


24 


349.4 


348.7 


350.3 


348.8 


• 349.3 


1.6 


' 25 


349.6 


349.1 


349.6 


351.2 


349.9 


2.1 


2 










8.745.2 


62.8 



Note: Data taken without permission from Grant and Leavenworth 
(1988). 
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Table 3 
Weight Data 



Subgroup 

lfo» Average Range 



1 


255 


10 


2 


330 


20 


3 


280 


100 


4 


235 


10 


5 


230 


40 


6 


240 


0 


7 


280 


20 


8 


235 


50 


9 


240 


20 


10 


315 


30 


11 


325 


10 


12 


280 


60 


13 


260 


20 


14 


'275 


50 


15 


330 


100 


16 


250 


40 


17 


320 


80 


18 


260 


60 


19 


275 


30 


20 


295 


30 


21 


225 


30 


22 


300 


40 


23 


330 


0 


24 


275 


10 


25 


290 


20 


26 


295 


10 


'27 


265 


10 


28 


280 


20 


29 


285 


10 


30 


295 


10 



Note: Data taken without permission from Wheeler and Chambers 
(1986). 
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Table 4 

Moving Ranges for Weight Data 



Subgroup No. 



MR 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 

26] 

27 I 

28 

29 

30 



255 
330 
280 
235 
230 
240 
280 
235 
240 
315 
325 
280 
260 
275 
330 
250 
320 
260 
275 
295 
225 
300 
330 
275 1 
290 I 
~295 j 
265 1 
280 • 
285 
295 



75 
50 
45 

5 
10 
40 
45 

5 
75 
10 
45 
20 
15 
5.5 
80 
70 
60 
15 
20 
70 
75 
30 
55 
15 

5 
30 
15 

5 
10 
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Table 5 
Invoice Data 



DAILY PROPORTIONS FOR INCOMPLETE INVOICES 





Number of 


Total 






Number of 


Total 




Date 


Incomplete 


Number of 




Date 


Incomplete 


Number of 






Invoices 


Invoices 


Pi 




Invoices 


Invoices 


Pi 


9/27 


20 


98 


,204 


10/11 


7 


50 


,140 


9/28 


18 


104 


,173 


10/12 


7 


53 


,132 


9/29 


li 


97 


,144 


10/13 


9 


56 


,161 


9/30 


16 


99 


,162 


10/14 


5 


49 


,102 


10/1 


13 


97 


,134 


10/15 


8 


56 


,143 


10/4 


29 


102 • 


,284 


10/18 


9 


53 


,170 


10/5 


21 


104 


,202 


10/19 


9 


52 


,173 


10/6 


14 


101 


,139 


10/20 


10 


51 


,196 


10/7 


6 


55 


,109 


10/21 


9 


52 


,173 


10/8 


6 


48 


,125 


10/22 


10 


47 


,213 



Note: Data taken without permission from Wheeler and Chambers 
(1986) . 
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MR-chart 



n^;?n^^^./".^'^ MR-charts for the Weight data when each time 
point is considered to have one observation 



(D 



I 

& 

01 
>1 
ft 

Ml 
O 
>1 

tr 

(D 

H 

< 
O 
H- 
O 
CD 

P- 
P» 
ft 
P) 



0.300 - 



0.200 - 




0.100 - 



0.000 - 
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